Exploiting Sublanguage and Domain Characteristics in a Bootstrapping Approach to Lexicon and Ontology Creation
نویسندگان
چکیده
It is very costly to build up lexical resources and domain ontologies. Especially when confronted with a new application domain lexical gaps and a poor coverage of domain concepts are a problem for the successful exploitation of natural language document analysis systems that need and exploit such knowledge sources. In this paper we report about ongoing experiments with ‘bootstrapping techniques’ for lexicon and ontology creation.
منابع مشابه
WordNet-Inspired Terminological Resources for Bio-NLP
WordNet is currently the most widely used lexicon resource for general English language. We here argue in favor of a similar lexical resource for biomedicine, BioWordNet, to extend the virtues of WordNet to this sublanguage domain. We present a simple approach to semi-automatically build up such a resource. It crucially builds on the conversion of structured domain knowledge taken from the Open...
متن کاملBootstrapping Biomedical Ontologies for Scientific Text using NELL
We describe an open information extraction system for biomedical text based on NELL (the Never-Ending Language Learner) (Carlson et al., 2010), a system designed for extraction from Web text. NELL uses a coupled semi-supervised bootstrapping approach to learn new facts from text, given an initial ontology and a small number of “seeds” for each ontology category. In contrast to previous applicat...
متن کاملar X iv : c s . A I / 05 01 09 5 v 1 31 J an 2 00 5 Context - related Derivation of Word Senses
Real applications of natural language document processing are very often confronted with domain specific lexical gaps during the analysis of documents of a new domain. This paper describes an approach for the derivation of domain specific concepts for the extension of an existing ontology. As resources, we need an initial ontology and a partially processed corpus of a domain. We exploit the spe...
متن کاملContext Related Derivation of Word Senses
Real applications of natural language document processing are very often confronted with domain specific lexical gaps during the analysis of documents of a new domain. This paper describes an approach for the derivation of domain specific concepts for the extension of an existing ontology. As resources, we need an initial ontology and a partially processed corpus of a domain. We exploit the spe...
متن کاملطراحی سامانه نیمهخودکار ساخت هستیشناسی بهکمک تحلیل همرخدادی واژگان و روش C-value (مطالعه موردی: حوزه علمسنجی ایران)
Ontology is one of formal concepts and the relations in the specific regions.It have recently tried to design the learning, automatic methods of Ontology. Whereas Ontology containing concepts and the relations, exploiting concepts, the semantic relations among concept. The various Ontology of regions and different applications are expensive processes that are automatic.The lack of main knowledg...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره cs.CL/0304035 شماره
صفحات -
تاریخ انتشار 2002